Accent Recognition with Hybrid Phonetic Features
نویسندگان
چکیده
The performance of voice-controlled systems is usually influenced by accented speech. To make these more robust, frontend accent recognition (AR) technologies have received increased attention in recent years. As a high-level abstract feature that has profound relationship with language knowledge, AR challenging than other language-agnostic audio classification tasks. In this paper, we use an auxiliary automatic speech (ASR) task to extract language-related phonetic features. Furthermore, propose hybrid structure incorporates the embeddings both fixed acoustic model and trainable model, making robust. We conduct several experiments on AESRC dataset. results demonstrate our approach can obtain 8.02% relative improvement compared Transformer baseline, showing merits proposed method.
منابع مشابه
Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors
We describe a new approach to automatic dialect and accent recognition which exceeds state-of-the-art performance in three recognition tasks. This approach improves the accuracy and substantially lower the time complexity of our earlier phoneticbased kernel approach for dialect recognition. In contrast to state-of-the-art acoustic-based systems, our approach employs phone labels and segmentatio...
متن کاملSyllable-level desynchronisation of phonetic features for speech recognition
This paper describes a novel approach to speech recognition which is based on phonetic features as basic recognition units and the delayed synchronisation of these features within a higher-level prosodic domain, viz. the syllable. The object of this approach is to avoid a rigid segmentation of the speech signal as it is usually carried out by standard segment-based recognition systems. The arch...
متن کاملSegmentation and recognition of phonetic features in handwritten Pitman shorthand
There is a wish to be able to enter text into mobile computing devices at the speed of speech. Only handwritten shorthand schemes can achieve this data recording rate. A new, overall solution to the segmentation and recognition of phonetic features in Pitman shorthand is proposed in this paper. Approaches to the recognition of consonant outlines, vowel and diphthong symbols and shortforms, whic...
متن کاملSpectral and temporal modulation features for phonetic recognition
Recently, the modulation spectrum has been proposed and found to be a useful source of speech information. The modulation spectrum represents longer term variations in the spectrum and thus implicitly requires features extracted from much longer speech segments compared to MFCCs and their delta terms. In this paper, a Discrete Cosine Transform (DCT) analysis of the log magnitude spectrum combin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Sensors
سال: 2021
ISSN: ['1424-8220']
DOI: https://doi.org/10.3390/s21186258